Audio Classification and Retrieval by Using Vector Quantization

نویسندگان

Shruti Vaidya

Kamal Shah

چکیده

In today’s world, we can say that information and its processing has become the critical aspect for functioning of everything. In the early days, information was generally obtained and processed in the form of text. Today information is available in all forms namely, text, music, graphics, etc. which are a easily understandable and accurately represent information. Information is first captured then the captured information is retrieved and analyzed for further requirements. In this paper, the information that we take into consideration is in audio form. We have studied the feature vector extraction methods, similarity measurement techniques, and have also measured the performance parameters. It has been observed that the use of multiple feature vectors provides better and more accurate classification and retrieval of audios from large database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Tree-based Quantization for Content Based Music Retrieval System

In this paper, we have proposed and implemented a new music retrieval system based on content of music wave files. We have investigated different quantization methods by constructing them into the music data histograms as the feature vectors for the music files. There are three important aspects that will affect implementation of the system: audio feature extraction, quantization and distance c...

متن کامل

A Similarity Measure for Automatic Audio Classification

This paper presents recent results using statistics generated by a MMl-supervised vector quantizer as a measure of audio similarity. Such a measure has proved successful for talker identification, and the extension from speech to general audio, such as music, is straightforward. A classifier that distinguishes speech from music and non-vocal sounds is presented, as well as experimental results ...

متن کامل

Content-based methods for the management of digital music

The literature on content-based music retrieval has largely finessed acoustic issues by using MIDI format music. This paper however considers content-based classification and retrieval of a typical (MPEG layer III) digital music archive. Two statistical techniques are investigated and appraised. Gaussian Mixture Modelling performs well with an accuracy of 92% on a music classification task. A T...

متن کامل

A high speed unsupervised speaker retrieval using vector quantization and second-order statistics

This paper describes an effective unsupervised method for query-by-example speaker retrieval. We suppose that only one speaker is in each audio file or in audio segment. The audio data are modeled using a common universal codebook. The codebook is based on bag-of-frames (BOF). The features corresponding to the audio frames are extracted from all audio files. These features are grouped into clus...

متن کامل

Latent topic model for audio retrieval

Latent topic model such as Latent Dirichlet Allocation (LDA) has been designed for text processing and has also demonstrated success in the task of audio related processing. The main idea behind LDA assumes that the words of each document arise from a mixture of topics, each of which is a multinomial distribution over the vocabulary. When applying the original LDA to process continuous data, th...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Audio Classification and Retrieval by Using Vector Quantization

نویسندگان

چکیده

منابع مشابه

An Efficient Tree-based Quantization for Content Based Music Retrieval System

A Similarity Measure for Automatic Audio Classification

Content-based methods for the management of digital music

A high speed unsupervised speaker retrieval using vector quantization and second-order statistics

Latent topic model for audio retrieval

عنوان ژورنال:

اشتراک گذاری